DeepSeek V4 Chinese Large Model Evaluation: Achieving the Glory of Domestic First Again!
In the evaluation of DeepSeek V4 Chinese large model, the Pro version regained the top position in the country with a score of 70.98, followed closely by the Flash version with 68.82 points. The evaluation covers six dimensions: mathematical reasoning, scientific reasoning, code generation, intelligent agent task planning, instruction following, and hallucination control, marking a new breakthrough in domestic open-source model technology.